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Abstract 

Background: Accurate classification of non-small cell lung cancer (NSCLC) using morphological features has several 
limitations. However, the use of thyroid transcription factor 1 (TTF-1) and Napsin A as markers for the identification of 
various subtypes of NSCLC has shown promise. This meta-analysis was designed to evaluate the diagnostic value of 
combined TTF-1 and Napsin A test to distinguish lung adenocarcinoma from squamous cell carcinoma. 

Methods: The Medline, EMBASE and Web of Science databases were searched, along with the reference lists of relevant 
articles (up to May 4, 2014). Ten studies containing 1,446 subjects were identified. The sensitivity, specificity, diagnostic odds 
ratio (DOR) and area under the summary receiver operating characteristics curve (AUC) were calculated to estimate the 
combined diagnostic value of TTF-1 and Napsin A. 

Results: The pooled sensitivity and specificity were 0.76 (95% CI: 0.69-0.83) and 1.00 (95% CI: 0.92-1.00), respectively. The 
positive and negative likelihood ratios were 877.60 (95% CI: 8.40-91533.40) and 0.24 (95% CI: 0.18-0.32). The DOR was 3719 
(95% CI: 33-414884). The AUC was 0.92 (95%CI: 0.89-0.94). The patient's location was a source of heterogeneity for 
sensitivity. The patient's location, the study's sample size and the threshold used to determine positive staining were 
consistently found to be sources of heterogeneity for specificity in subgroup analyses and meta-regression. 

Conclusions: The combined test of TTF-1 and Napsin A presents a promising alternative method, useful to distinguish 
between lung adenocarcinoma and squamous cell carcinoma. 
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Introduction 

Lung cancer is the leading cause of cancer mortality worldwide. 
Approximately eighty percent of lung cancers are determined to 
be non-small cell lung cancer (NSCLC), a heterogeneous group 
comprised of adenocarcinoma (AC) and squamous cell carcinoma 
(SQCC) [1]. Due to the prevalence of NSCLC, extensive 
researches in the field have enabled the development of various 
therapies targeting specific types of lung cancer [2]. Therefore, 
demand for the accurate classifications of different subtypes of 
NSCLC has risen to allow maximization of the subsequent 
therapeutic response and minimization of any adverse effects, 
especially for poorly differentiated non-small cell lung carcinomas, 
which were classified generally as non-small cell carcinoma not 
otherwise specified (NSC NOS) based on hematoxylin-eosin 



(H&E) immunohistochemistry method [3-5]. However, poorly 
differentiated non-small cell lung carcinomas could be further 
divided into several subtypes, mostly into adenocarcinomas (AC) 
and squamous cell carcinoma (SQCC), which require different 
treatment strategies. For example, AC frequently harbour 
activating mutations of the epidermal growth factor receptor gene 
(EGFR) or EML4-ALK rearrangements, so epidermal growth 
factor receptor (EGFR) inhibitors and vascular endothelial growth 
factor (VEGF) inhibitors are more effective in treating AC than 
SQCC [6-8]. SQCC commonly express insulin-like growth factor 
1 receptor (IGF-1R), which is a target for figitumumab. In 
addition, molecularly-targeted therapies such as Avastin should 
not be used to treat SQCC patients, as it has a 30% mortality rate, 
due to fatal hemoptysis [9,10]. Therefore, misdiagnosis or 
inaccurate diagnosis of poorly differentiated non-small cell lung 
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carcinoma by single hematoxylin-eosin (H&E) immunohistochem- 
ical method may lead to patients' insensitivity to subsequent 
therapies and maximization of adverse effects. And this will in turn 
lead to difficult management of lung cancer patients and huge 
consumption of medical and health resources. Alternatively, the 
use of new molecular biomarkers, whose identification only 
require a small amount of cells, has been proposed as a powerful 
tool to distinguish AC from SQCC [11-14]. The value of 
immunohistochemical markers has been well established in 
separating poorly differentiated AC from SQCC [12,15-17]. 
Two such markers recently showing promise are Thyroid 
transcription factor 1 (TTF-1) and Napsin A. 

TTF- 1 , a highly-conserved homeodomain-containing transcrip- 
tional factor involved in the early development of lung. It is a well 
established immunomarker for lung adenocarcinoma, with sensi- 
tivity ranging from 60% to 100%, and specificity ranging from 
97% to 100% [18-21]. Napsin A, a functional aspartic proteinase 
expressed in the cytoplasm of healthy lung parenchyma, has 
recently been reported as a promising immunomarker associated 
with lung adenocarcinomas [22,23]. And the potential of TTF-1 
and Napsin A has been revealed in separating AC from SQCC 
even in poorly differentiated NSCLC [17]. Stoll et al. reported the 
sensitivity and specificity for Napsin A immunohistochemistry in 
diagnosing poorly differentiated pulmonary ADCs were 65% and 
96%, respectively [16]. Besides the advantage in the diagnosis of 
lung adenocarcinoma, both TTF-1 and Napsin A have also been 
found in biopsies of lung squamous cell carcinoma at rates of 2% 
to 13% for TTF-1, and upwards of 26% for Napsin A [23-28]. 
This has led researchers to propose the combined test of TTF- 1 
and Napsin A in order to subclassify NSCLC [29,30]. However, 
the diagnostic accuracy of the combined test has varied among 
studies, with the specificities ranging from 0.88 to 1.00 
[22,24,27,31], and the sensitivity between 0.47 and 1.00 
[1,8,30,32]. Until now, the diagnostic value of the combined test 
has not been systematically evaluated. The present meta-analysis 
was designed to summarize the evidence behind the value of a 
combined TTF-1 and Napsin A test for subtype-classification of 
lung AC and SQCC in NSCLC. 

Materials and Methods 

Literature search 

A thorough search of Medline, EMBASE and Web of Science 
(up to May 4, 2014) was conducted to identify eligible studies. The 
following search terms were used: "lung cancer" OR "lung 
neoplasms" OR "lung neoplasm" OR "lung carcinoma" OR 
"lung tumor" OR "pulmonary neoplasm" OR "pulmonary 
neoplasms" OR "pulmonary cancer" OR "carcinoma, non-small 
cell lung" OR "non-small cell lung cancer" OR "non-small cell 
lung carcinoma" OR NSCLC, "thyroid transcription factor- 1" 
OR TTF-1, Napsin A, without language restrictions. The 
references of any relevant articles were also scanned for potentially 
missing studies. 

Selection criteria 

Selection criteria were given as follows: (1) the combined 
detection of TTF-1 and Napsin A was used to distinguish between 
AC and SQCC; (2) the combined test of TTF-1 and Napsin A was 
serial test, which mean tissues with positive immunohistochemical 
staining for both TTF1 and Napsin A were determined as 
adenocarcinomas, otherwise, as squamous cell carcinoma; (3) 
morphological diagnoses, such as hematoxylin-eosin staining, were 
used as the reference diagnostic standard; (4) results were reported 
in numbers of true-positive, false-positive, true-negative, and false- 



negative, or sufficiendy detailed data were presented to derive 
these numbers. Studies were excluded if they were as follows: (1) a 
case report, review or conference proceeding; (2) containing no 
specific results regarding lung cancer; (3) with a sample size less 
than 50. For multiple or duplicate publications that covered the 
same dataset, only the most recent or complete study was included. 

Data extraction and quality assessment 

The two investigators independently extracted data from all 
studies fulfilling the inclusion criteria. The following information 
was extracted from each study: first author, year of publication, 
patients' location, sample size, the number of AC and SQCC, 
specimen type and the threshold for staining positivity. Any 
disagreement was resolved by consensus. The quality of method- 
ology for each study was assessed using the quality assessment of 
diagnostic accuracy studies (QUAD AS) [33]. For each item, a 
score of 1 was applied if the answer was "yes"; otherwise, a score 0 
was applied. 

Statistical analysis 

For studies with true-positive, true-negative, false-positive and 
false-negative numbers available, we computed the following 
parameters and corresponding 95 % confidence intervals for each 
study: sensitivity, specificity, and the diagnostic odds ratio 
combined by positive and negative likelihood ratios (LRs). Data 
were finally pooled in summarized receiver-operating character- 
istic curves (sROC), where the area under the sROC (AUC) 
measures test precision. The heterogeneity was explored with a 
Cochran's chi-square test and quantified by calculating the I 2 
statistic to reflect the degree of variability in results across studies. 
To assess any potential confounding factors, including the patient's 
location (Western vs. Asian), specimen types (resected vs. others), 
sample size (over 100 vs. less than 100), and threshold for staining 
positivity (&1% vs. >10%), the subgroup analyses and meta- 
regression were performed taking the above factors into account. 
Additionally, sensitivity analysis was also performed to assess the 
influence of any individual study on the overall estimate. 

In order to test for publication bias, Deek's funnel plot method 
was applied. Statistical analyses were performed using Midas 
module in the Stata (Version 10.0), and all P-values calculated as 
two-sided. The association was considered significant if the P- 
value was less than 0.05. 

Results 

Characteristics and methodological quality of include 
studies 

The search of the databases produced 222 studies, of which 84 
were excluded for duplication, leaving 138 potentially relevant 
studies to be retrieved. One hundred and five studies were 
excluded following the first screening based on abstracts and tides. 
Thirty-three studies were then retrieved for full text review. After 
carefully reading the full text articles, 2 were excluded for 
duplication and 6 studies were excluded for containing a sample 
size less than 50 [1,9,10,12,29,34]. And 15 studies were excluded 
for containing insufficient data to calculate sensitivity and 
specificity. Finally, 10 studies were eligible for inclusion in this 
meta-analysis [8,11,22,24,27,31,32,35-37] (Figure 1). The char- 
acteristics of the included studies are detailed in Table 1 . 

The quality assessments on the included studies are presented in 
Figure 2. Out of 14 QUAD AS items, item 1 (spectrum 
composition) and 2 (selection criteria) describe the variability of 
the studies, while items 8 (index test execution), 9 (reference 
standard execution) and 1 3 (uninterpretable test results) assess the 



PLOS ONE | www.plosone.org 



2 



July 2014 | Volume 9 | Issue 7 | e100837 



Combined Testing of TTF-1 and Napsin A to Class NSCLC 



138 potentially relevant publications identified from online search strategy 



105 papers rejected based on title and abstracts: 

• 51 conference 

• 18 case reports 
. 5 reviews 

• 30 diagnosis for other lung lesion 
. 1 letter 



33 papers full text reviewed 



23 excluded as: 

. 2 duplicate publications 

• 15 insufficient data 

• 6 sample size less than 50 



10 papers included in the Meta-analysis 



Figure 1. Study flow chart for the process of selecting the eligible publications. 

doi:1 0.1 371 /journal.pone.01 00837.g001 



quality of the reporting. The remainders are about the bias of the 
studies. The QUAD AS scores of the studies ranged from 8 to 1 2 
with a median score of 10. Items 1 and 2 were 60.0% and 30.0% 
fulfilled by studies respectively, indicating high variability among 
studies. The execution proportions of item 8, 9 and 13 were 
60.0%, 20.0% and 40.0% respectively, suggesting poor reporting 
quality of studies. The remainders of the items reached a level of 
100%, with exception of items 5 (verification of diagnosis), and 10 
(blinding for index test results), which were only 60.0% and 50.0% 
fulfilled by the studies, respectively. 



Meta-analysis and heterogeneity 

Ten studies containing 1,446 subjects were summarized to 
assess the diagnostic accuracy of the combined test of TTF- 1 and 
Napsin A. The pooled sensitivity and specificity were 0.76 (95% 
CI: 0.69-0.83), and 1.00 (95% CI: 0.92-1.00), respectively. Forest 
plots showed relative strength of the diagnostic accuracy of the 
combined test (Figure 3). The positive and negative LRs of the 
studies were 877.60 (95% CI: 8.40-91533.40) and 0.24 (95% CI: 
0.18-0.32), respectively. The pooled diagnostic odds ratio was 
3719 (95% CI: 33-414884). The area under the sROC was 0.92 



Table 1. Characteristics of Studies Included in the Meta-analysis. 





First Author 


Year 


Country 


Sample size 


AC/SQCC 


Specimen type 


Stain positive 


Bishop 


2010 


America 


102 


50/52 


resected specimens 


==1% a 


Yang 


2010 


America 


197 


158/39 


resected specimens 


al% 


Zhang 


2010 


China 


297 


212/85 


resected specimens 


>10% b 


Yanagita 


2011 


Japan 


64 


39/25 


surgicafbiopsy cases 


2s 1% 


Fatima 


2011 


America 


59 


35/24 


FNA 


2S 1% 


Turner 


2012 


Japan 


188 


94/94 


resected specimens 


2s 1% 


Tacha 


2012 


America 


210 


1 1 5/95 


resected specimens 


£10% 


Noh 


2012 


Korea 


74 


36/38 


resected specimens 


2s 1% 


Collins 


2013 


America 


69 


35/34 


FNA 


2s10% 


Brunnstrom 


2013 


Swedish 


186 


121/65 


resected specimens 


>1% 



NOTE: 

a , cases with more than 1% of tumor cells staining were classified as positive; 

b , cases with more than 10% of tumor cells staining were classified positive; 

FNA, Fine-needle aspiration; AC, lung adenocarcinoma; SQCC, squamous cell carcinoma. 
doi:1 0.1 371 /journal.pone.01 00837.t001 



PLOS ONE | www.plosone.org 



3 



July 2014 | Volume 9 | Issue 7 | e100837 



Combined Testing of TTF-1 and Napsin A to Class NSCLC 



Representative spectrum 
Selection criteria clearly described 
Adequate reference standard 
Time between tests 
Complete verification of diagnosis 
No differential verification 
No incorporation bias 
Adequate index test description 
Adequate reference test description 
Blinding for index test results 
Blinding for reference test results 
Clinical data available as in practice 
Uninterpretabletest results reported 
Withdrawals explained 
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Figure 2. The methodological quality assessments of studies included in the meta-analysis, the vertical coordinate presents 14 
QUADAS items of the quality assessment of diagnostic accuracy studies (QUADAS); the horizontal axis presents percentages of 14 
QUADAS items fulfilled by studies included in the meta-analysis. 

doi:10.1371/journal.pone.0100837.g002 



(95% CI: 0.89-0.94), indicating high precision of the combined 
test (Figure 4). 

Significant heterogeneity was observed both in sensitivity 
(Q,= 44.33; ft=0.01; I 2 = 79.70) and specificity (Q_= 48.76; P< 
0.01; I 2 = 81.54). Subsequently, subgroup analyses and meta- 
regression were conducted to identify the sources of heterogeneity 



with respect to the patient's location, sample size, specimen type, 
and threshold of staining positivity. The results of the subgroup 
analyses showed significant heterogeneity observed for sensitivity 
in all subgroups (Table 2). The heterogeneity of specificity in 
studies conducted in Asians was not statistically significant 
(I 2 = 18.84; P= 0.30). However, it was adverse in studies conduct- 
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0.89 [0.82 - 0.94] Brunnstrom/21 03 




Collins/2013 
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0 89 [0.73 - 0.97] 


Collins/2013 


i 


Noh/2012 


_ 

I 
I 


0.47 [0.30 - 0.65] 


Noh/2012 


—4 

I 


Tacha/2012 


-■4 

I 
I 


0.70 [0.60 - 0.78) 


Tacha/2012 


-* 

i 
I 


Turner/2012 


I 

-4- 


0 64 [0.53 - 0.73] 


Turner/2012 


—m 

i 


Fatima/2011 


i 


0.74 [0.57 - 0.88] 


Fatima/2011 


■ 


Yanagita/2011 


i ™ 


0 85 [0 69 - 0.94] 


Yanagita/201 1 


-4 


Zhang/2010 




0.78 [0.72 - 0.84] 


Zhang/2010 




Yang/2010 




0.78 [0.71 - 0.84] 


Yang/2010 


4 


Bishop/2010 


i 
i 
i 


0.78 [0.64 - 0.88] 


Bishop/2010 




COMBINED 


i 

4 
i 


0 76[0 .69 - 0.83] 


COMBINED 






i 
i 


Q = 44.33. df = 9.00. p = 0.00 






i 

U ' r- 


I2 = 79.70[67.71 -91.68) 




-. 1 



SPECIFICITY (95% CI) 

1 .00 [0.94 - 1 .00] 
1 .00 [0.90 - 1 .00] 
1 .00 [0.91 - 1 .00] 
1 .00 [0.96 - 1 .00] 
0.98 [0.93 - 1 .00] 
0.88 [0.68 - 0 97] 
1.00 [0.86- 1.00] 
1 .00 [0.96 - 1 .00] 
1.00 [0.91 -1.00] 
1 .00 [0.93 - 1 .00] 

1 .00[0.92 - 1 .00) 
Q = 48.76. df = 9.00. p = 0.00 
12 = 81.54(70.91 -92.17] 



0.3 



SENSITIVr 



0.7 



Figure 3. Pooled 

doi:10.1371/journal 



sensitivities and specificities of combined test of TTF-1 and Napsin 

.pone.0100837.g003 



SPECIFICITY 

A in distinction between AC and SQCC. 
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0.5 




o.o H 1 1 

1.0 0.5 0.0 

Specificity 

Figure 4. ROC curve analyses of combined test of TTF-1 and 
Napsin A in distinction between AC from SQCC. ACU = 0.92[0.89- 
0.94], sensitivity = 0.76[0.69-0.83], specificity = 1.00[0.92-1. 00], Solid line 
presents sROC; circle presents observed data; diamond presents 
summary operating point; dash line presents 95% confidence contour; 
points dotted line presents 95% prediction contour. 
doi:10.1371/journal.pone.0100837.g004 

ed in western countries (1 —87.40; i^O.Ol). When compared to 
holistic heterogeneity of specificity (I =81.54; P<0.01), the 
heterogeneity was reduced and was not statistically significant 
for studies with large sample sizes (I 2 = 40.08; P— 0.14), similar in 
studies that only contained resected specimens as test materials 
(I 2 = 34.35; P—QA7). And the sensitivity and specificity of the 
seven studies as just resected specimens used were not significant 
different from that of the ten studies as resected specimens and 
biopsies combined (0.74 (95%CI: 0.65-0.82) vs. 0.76 (95%CI: 
0.69-0.83)), similar in specificity (1.00 (95%CI: 0.92-1.00) vs. 1.00 
(95%CI: 0.92-1.00)). It indicated that the three studies [27,32,36] 
using small biopsy or cytology specimens as the reference 
diagnostic standard were not responsible for influencing hetero- 
geneity in sensitivity and specificity. Through meta-regression, the 
source of heterogeneity for sensitivity was found to be the patient's 
location (PsO.Ol). The source of heterogeneity for specificity was 
found to be a result of the patient's location, sample size, and 
threshold of staining positivity (all with P-values of less than 0.001). 
Furthermore, sensitivity analysis showed that removing study of 



Table 2. Subgroup analysis. 



Noh [24] resulted in a wider range in 95%CI of specificity but the 
specificity was stable (Table SI). 

Finally, the funnel plots for the diagnostic value of combined 
test did not reveal any evidence of obvious asymmetry, with all P- 
values of the Egger's test greater than 0.05 (Figure SI). Therefore, 
it was determined that publication bias did not have a statistically 
significant effect in this meta-analysis. 

Discussion 

The combined test of TTF- 1 and Napsin A has been considered 
as a promising alternative tool to subclassify NSCLC in clinical 
practice. However, the sensitivity and specificity of this combined 
test has varied greatly in previous studies. The present meta- 
analysis confirmed that the diagnostic value of this combined test 
was acceptable in distinguishing AC from SQCC, with a relatively 
high AUC (0.92, 95%CI: 0.89-0.94) and specificity (1.00, 95%CI: 
0.92-1.00), and moderate sensitivity (0.76, 95%CI: 0.69-0.83). 

In recent years, the need for accurate subclassification of 
NSCLC has increased, as emerging evidence has suggested that 
the specific NSCLC subtypes will respond differendy to targeted 
therapies. The current World Health Organization classification of 
lung cancer has been based almost entirely on the results of H&E 
staining. Although H&E evaluation could provide sufficient 
information to classify NSCLC subtypes in many cases, accurate 
diagnoses may be limited in case of where: only small biopsies or 
cytology specimens are available, there is poorly differentiated 
neoplasm, or there is a marked disruption of the histological 
architecture. What's more, the concordance rates among pathol- 
ogists often vary significantly. One study showed that concordance 
rates among pathologists in subclassifying NSCLC by H&E alone 
were only 81% [38]. As a result of these limitations, several 
molecular diagnoses have been proposed to subclassify NSCLC in 
recent years, such as EGFR mutations, K-ras mutations, EML4- 
ALK fusions and miRNA profiling [11,39-42]. EGFR mutations, 
K-ras mutations and EML4-ALK fusions have been shown to be 
primarily restricted to lung adenocarcinoma. Following identifi- 
cation, specific therapies could then be used, such as the EGFR 
inhibitors Erlotinib and Gefitinib that showed the greatest benefit 
in EGFR mutation-positive tumors, which were predominandy 
adenocarcinoma. Similarly, tumors presenting EML4-ALK fusion 
proteins or fusion gene could be treated with greater success by 
Crizotinib, an inhibitor of anaplastic lymphoma kinase (ALK). 
Although these molecular diagnostic markers were precise and 
contributed to targeted therapies, most of the studies used EGFR 



Subgroup 




SEN (95% CI) 


SPE (95% CI) 


AUC 


l 2 (SEN,95%CI) 


l 2 (SPE,95%CI) 


Patient's location 


Western 


0.80 (0.72-0.86) 


1.00 (0.09-1.00) 


0.90 


70.57 (45.64-95.51) 


87.40 (78.73-96.06) 




Asian 


0.70 (0.56-0.82) 


0.99 (0.96-1.00) 


c 


85.96 (73.40-98.52) 


18.84 (0.00-100.00) 


Sample size 


Large 3 


0.77 (0.70-0.83) 


1.00 (0.82-1.00) 


0.92 


78.61 (61.77-95.46) 


40.08 (0.00-95.51) 




Small" 


0.76 (0.58-0.88) 


1.00 (0.53-1.00) 


0.94 


85.63 (72.69-98.56) 


81.97 (64.83-99.11) 


Specimen type 


Resected 


0.74 (0.65-0.82) 


1.00 (0.92-1.00) 


0.96 


84.18 (73.58-94.78) 


34.35 (0.00-90.83) 


Staining positive 


>1% d 


0.76 (0.65-0.84) 


1.00 (0.90-1.00) 


0.93 


83.88 (73.02-94.74) 


78.73 (63.35-94.11) 



NOTE: 

a , the sample size > 100; 

b , the sample size <100; 

c , the AUC can't be calculated to get a result; 

d , cases with more than 1% of tumor cells staining were classified as positive; 

SEN, sensitivity; SPE, specificity; AUC, the summary receiver operating characteristics curve; I 2 , I 2 statistic; 95%CI, 95% confidence interval. 
doi:1 0.1 371 /journal.pone.01 00837.t002 
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and K-ras mutations to classify the subtypes of lung adenocarci- 
noma. An additional factor that must be considered is that EML4- 
ALK fusions were detected in only about 5% of lung adenocar- 
cinoma, and it was mutually exclusive to other EGFR mutations 
and K-ras mutations [43]. These problems were further com- 
pounded by the fact that, molecular testing for these markers was 
complex and not easy to perform. Similar problems were met in 
miRNA profiling. MiRNA, as noncoding single-stranded RNAs 
regulating gene expression, were reported as a highly reliable 
method in distinguishing lung SQCC from AC [11,13]. However, 
miRNA profiling had high requirements for specimen preservation 
conditions as miRNA is easily degraded, which presents a 
challenge for clinical diagnoses. Compared to these markers, 
testing for TTF-1 and Napsin A not only had a high diagnostic 
value, but also was easy to perform. This would be a powerful 
alternative method to distinguish AC from SQCC in lung cancer 
diagnosis and treatment. 

Heterogeneity was statistically significant for both sensitivity and 
specificity of the combined detection of TTF-1 and Napsin A. 
Subgroup analyses showed that the stratification of the confound- 
ing factors did not remove the heterogeneity for sensitivity. For 
specificity however, the heterogeneity was not statistically signif- 
icant in studies that were conducted in Asians, had large sample 
sizes and studies in which resected specimens were used. Meta- 
regression suggested that the patient's location contributed the 
most to the heterogeneity of sensitivity, and the patient's location, 
sample size and threshold of staining positivity were the sources of 
heterogeneity for specificity. The heterogeneity among studies 
conducted in different locations may be attributed most to ethnic 
differences, which may lead to different expression levels of TTF- 1 
and Napsin A. The heterogeneity in specificity among studies with 
different sample sizes may be due to the fact that the entire 
spectrum of patients in studies with small sample sizes was not 
representative in most cases. In order to further explore influence 
of specimen types on the pool sensitivity and specificity of the 
combined test, we compared pool sensitivity and specificity of the 
rest seven studies using resected samples only with those of the ten 
studies as resected specimens and biopsies combined. The results 
indicated that the pool sensitivity and specificity of the rest seven 
studies were not significant different from that of the ten studies 
(0.74 (95%CI: 0.65-0.82) vs. 0.76 (95%CI: 0.69-0.83) and 1.00 
(95%CI: 0.92-1.00) vs. 1.00 (95%CI: 0.92-1.00)), which was 

References 

1. Brown AF, Sirohi D, FukuokaJ, Gaglc PT, Policarpio-Nicolas M, ct al. (2013) 
Tissue-preserving antibody eoektails to differentiate primary squamous cell 
carcinoma, adenocarcinoma, and small cell carcinoma of lung. Arch Pathol Lab 
Med 137:1274-1281. 

2. Savas P, Hughes B, Solomon B (2013) Targeted therapy in lung cancer: IPASS 
and beyond, keeping abreast of the explosion of targeted therapies for lung 
cancer. J Thorac Dis 5:S579-S592. 

3. Noh S, Shim H (2012) Optimal combination of immunohistochemical markers 
for subclassification of non-small cell lung carcinomas: A tissue mieroarray study 
of poorly differentiated areas. Lung Cancer 76:51-55. 

4. Righi L, Graziano P, Fornari A, Rossi G, Barbarcschi M, et al. (2011) 
Immunohistochemical subtyping of nonsmall cell lung cancer not otherwise 
specified in line-needle aspiration cytology: A retrospective study of 103 cases 
with surgical correlation. Cancer 117:3416-3423. 

5. Mukhopadhyay S, Katzenstein AL (201 1) Subclassification of non-small cell lung 
carcinomas lacking morphologic differentiation on biopsy specimens: Utility of 
an immunohistochemical panel containing TTF-1, napsin A, p63, and CK5/6. 
Am J Surg Pathol 35:15-25. 

6. Cohen MH, Gootcnbcrg J, Kecgan P, Pazdur R (2007) FDA drug approval 
summary: bevacizumab (Avastin) plus Carboplatin and Paclitaxel as first-line 
treatment of advanced/ metastatic recurrent nonsquamous non-small cell lung 
cancer. Oncologist 12:713-718. 

7. Fauma N, Cohen C, Siddiqui MT (2011) DOG1 utility in diagnosing 
gastrointestinal stromal tumors on fine-needle aspiration. Cancer Cytopathol 
119:202-208. 



consistently found not to be a source of heterogeneity for sensitivity 
and specificity in meta-regression analysis. Sensitivity analysis 
showed that the heterogeneity was not statistically significant until 
the Fatima (201 1) study was excluded, supporting the notion that 
the Fatima (201 1) study affected the heterogeneity of specificity the 
most. This was due to the study's poorly differentiated areas of AC 
and SQCC. 

Although the study provides a comprehensive assessment on the 
diagnostic value of the combined test of TTF- 1 and Napsin A, 
several precautions should be taken into account when interpret- 
ing the results. First, significant inter-study heterogeneity was 
found. Second, the estimates that we obtained were not adjusted to 
account for other variables such as tumor size, histologic 
differentiation, and clinical staging. However, we still observed 
trends supporting the high diagnostic value of the test in patients 
with well- and moderately-differentiated lung adenocarcinoma in 
their tissue sections. Third, population characteristics such as the 
smoking, sex and age should be put forward in the included 
studies, so that a more detailed subgroup analyses can be made to 
explain the heterogeneity. 

Overall, our meta-analysis showed that the combined test of 
TTF-1 and Napsin A was a high diagnosis accuracy alternative 
diagnostic test in classifying AC and SQCC in NSCLC. 

Supporting Information 

Figure SI Deeks' funnel Plot Asymmetry test of com- 
bined TTF-1 and Napsin A in distinction between AC 
from SQCC. Circle presents study; solid line presents regression 
line. See Figure Sl.tif file. 
(TIF) 

Table SI Sensitive analysis for studies included in the meta- 
analysis. See Table SI. doc file. 
(DOC) 

Checklist SI PRISMA 2009 Checklist. 

(DOC) 

Author Contributions 

Conceived and designed the experiments: L. Li SW L. Liu. Performed the 
experiments: L. Li XLJY. Analyzed the data: L. Li SW. Wrote the paper: 
L. Li SW L. Liu. Review, and/ or revision of the manuscript: SW L. Liu. 



8. Herbst RS (2008) Bevacizumab/ chemotherapy in non-small-cell lung cancer: 
looking for a few good men? Clin Lung Cancer 9:75-76. 

9. Sandler A (2007) Bevacizumab in non small cell lung cancer. Clin Cancer Res 
13:s4613-s4616. 

10. Herbst RS, O'Neill VJ, Fehrenbacher L, Bclan CP, Bonomi PD, et al. (2007) 
Phase II study of efficacy and safety of bevacizumab in combination with 
chemotherapy or crlotinib compared with chemotherapy alone for treatment of 
recurrent or refractory non small-cell lung cancer. J Clin Oncol 25:4743-4750. 

11. Bishop JA, Benjamin H, Cholakh H, Chajut A, Clark DP, ct al. (2013) Accurate 
classification of non-small cell lung carcinoma using a novel mieroRNA-bascd 
approach. Clin Cancer Res 16:610-619. 

12. Roh MH, Schmidt L, Placido J, Farmcn S, Fields KL, et al. (2012) The 
application and diagnostic utility of immunocytoehemistry on direct smears in 
the diagnosis of pulmonary adenocarcinoma and squamous cell carcinoma. 
Diagn Cytopathol 40:949-955. 

13. Lebanony D, Benjamin H, Gilad S, Ezagouri M, Dov A, et al. (2013) Diagnostic 
assay based on hsa-miR-205 expression distinguishes squamous from non- 
squamous non-small-cell lung carcinoma. J Clin Oncol 27:2030-2037. 

14. Lee HJ, Kim YT, Kang CH, Zhao B, Tan Y, ct al. (2013) Epidermal growth 
factor receptor mutation in lung adenocarcinomas: relationship with CT 
characteristics and histologic subtypes. Radiology 268:254—264. 

15. Loo PS, Thomas SC, Nicolson MC, Fyfe MN, Kerr KM (2010) Subtyping of 
undifferentiated non-small cell carcinomas in bronchial biopsy specimens. 
J Thorac Oncol 5:442-447. 



PLOS ONE | www.plosone.org 



6 



July 2014 | Volume 9 | Issue 7 | e100837 



Combined Testing of TTF-1 and Napsin A to Class NSCLC 



16. Stoll LM, Johnson MW, Gabrielson E, Askin F, Clark DP, et al. (2010) The 
utility of napsin-A in the identification of primary and metastatic lung 
adenocarcinoma among cytologically poorly differentiated carcinomas. Cancer 
Cytopathol 118:441^49. 

17. Tsuta K, Tanabe Y, Yoshida A, Takahashi F, Maeshima AM, ct al. (2011) 
Utility of 10 immunohistochemical markers including novel markers (desmo- 
collin-3, glypican 3, S100A2, S100A7, and Sox-2) for differential diagnosis of 
squamous cell carcinoma from adenocarcinoma of the Lung. J Thorae Oncol 
6:1190-1199. 

18. Johansson L (2004) Histopathologic classification of lung cancer: Relevance of 
eytokeratin and TTF-1 immunophenotyping. Ann Diagn Pathol 8:259—267. 

19. Whithaus K, FukuokaJ, Prihoda TJJagirdarJ (2012) Evaluation of napsin A, 
eytokeratin 5/6, p63, and thyroid transcription factor 1 in adenocarcinoma 
versus squamous cell carcinoma of the lung. Arch Pathol Lab Med 136:155-162. 

20. Schultz H, Marwitz S, Baron-Luhr B, Zissel G, Kugler C, et al. (2011) 
Generation and evaluation of a monoclonal antibody, designated MAdL, as a 
new specific marker for adenocarcinomas of the lung. BrJ Cancer 105:673-681. 

21. Gu K, Shah V, Ma C, Zhang L, Yang M (2007) Cytoplasmic immunorc activity 
of thyroid transcription factor- 1 (clone 8G7G3/1) in hepatoeytes: true positivity 
or cross-reaction? Am J Clin Pathol 128:382-388. 

22. Turner BM, Cagle PT, Sainz IM, FukuokaJ, Shen SS, ct al. (2012) Napsin A, a 
new marker for lung adenocarcinoma, is complementary and more sensitive and 
specific than thyroid transcription factor 1 in the differential diagnosis of primary 
pulmonary carcinoma: evaluation of 1674 cases by tissue microarray. Arch 
Pathol Lab Med 136:163-171. 

23. Ueno T, Lindcr S (2003) Elmbcrger G. Aspartic proteinase napsin is a useful 
marker for diagnosis of primary lung adenocarcinoma. BrJ Cancer 88:1229- 
1233. 

24. Tacha D, Yu C, Bremer R, Qj W, Haas T (2012) A 6-antibody panel for the 
classification of lung adenocarcinoma versus squamous cell carcinoma. Appl 
Immunohistochem Mol Morphol 20:201-207. 

25. Kim MJ, Shin HC, Shin KC, Ro JY (2013) Best immunohistochemical panel in 
distinguishing adenocarcinoma from squamous cell carcinoma of lung: tissue 
microarray assay in resected lung cancer specimens. Ann Diagn Pathol 17:85— 
90. 

26. Stahlman MT, Gray ME, Whitsett JA (1996) Expression of thyroid transcription 
factor- 1 (TTF-1) in fetal and neonatal human lung. J Histochcm Cytochem 
44:673-678. 

27. Fatima N, Cohen C, Lawson D, Siddiqui MT (2011) TTF-1 and Napsin A 
double stain: a useful marker for diagnosing lung adenocarcinoma on fme-ncedle 
aspiration cell blocks. Cancer Cytopathol 119:127-133. 

28. Percira TC, Share SM, Magalhaes AV, Silverman JF (201 1) Can we tcU the site 
of origin of metastatic squamous cell carcinoma? An immunohistochemical 
tissue microarray study of 194 cases. Appl Immunohistochem Mol Morphol 
19:10-14. 

29. Sethi S, Geng L, Shidham VB, Archuletta P, Bandyophadhyay S, ct al. (2012) 
Dual color multiplex TTF-1 -I- Napsin A and p63 -I- CK5 immunostaining for 
subcategorizing of poorly differentiated pulmonary non-small carcinomas into 
adenocarcinoma and squamous cell carcinoma in fine needle aspiration 
specimens. Cytojournal 9:10. 



30. Johnson H, Cohen C, Fatima N, Duncan D, Siddiqui MT (2012) Thyroid 
transcription factor 1 and napsin a double stain: utilizing different vendor 
antibodies for diagnosing lung adenocarcinoma. Acta Cytol 56:596-602. 

31. Yang M, Nonaka D (2010) A study of immunohistochemical differential 
expression in pulmonary and mammary carcinomas. Mod Pathol 23:654-661. 

32. Collins BT (2013) Endobronchial ultrasound fmc-nccdle aspiration biopsy of 
pulmonary non-small cell carcinoma with subclassification by immunohisto- 
chemistry panel. Cancer Cytopathol 121:146-154. 

33. Whiting P, Rutjes AW, Reitsma JB, Bossuyt PM, Kleijnen J (2003) The 
development of QUADAS: a tool for the quality assessment of studies of 
diagnostic accuracy included in systematic reviews. BMC Med Res Methodol 
3:25. 

34. Aikawa E, Kawahara A, Hattori S, Yamaguchi T, Abe H, ct al. (2011) 
Comparison of the expression levels of napsin A, thyroid transcription factor- 1, 
and p63 in nonsmall cell lung cancer using cytoecntrifuged bronchial brushings. 
Cancer Cytopathol 119:335-345. 

35. Brunnstrom H, Johansson L, Jirstrom K, Jonsson M, Jonsson P, ct al. (2013) 
Immunohistochcmistry in the differential diagnostics of primary lung cancer: an 
investigation within the southern Swedish lung cancer study. Am J Clin Pathol 
140:37-46. 

36. Yanagita E, Imagawa N, Ohbayashi C, Itoh T (201 1) Rapid multiplex 
immunohistochcmistry using the 4-antibody cocktail YANA-4 in differentiating 
primary adenocarcinoma from squamous cell carcinoma of the lung. Appl 
Immunohistochem Mol Morphol 19:509-513. 

37. Zhang P, Han YP, Huang L, Li Q, Ma DL (2010) Value of napsin A and thyroid 
transcription factor- 1 in the identification of primary lung adenocarcinoma. 
Oncol Lett 1:899-903. 

38. Stang A, Pohlabeln H, Muller KM, Jahn I, Giersiepen K, ct al. (2006) 
Diagnostic agreement in the histopathologieal evaluation of lung cancer tissue in 
a population-based case-control study. Lung Cancer 52:29-36. 

39. Li AR, Chitale D, Riely GJ, Pao W, Miller VA, ct al. (2008) EGFR mutations in 
lung adenocarcinomas: clinical testing experience and relationship to EGFR 
gene copy number and immunohistochemical expression. J Mol Diagn 10:242— 
248. 

40. Yoshida A, Tsuta K, Nakamura H, Kohno T, Takahashi F, et al. (2011) 
Comprehensive histologic analysis of ALK-rcarranged lung carcinomas. 
AmJ Surg Pathol 35:1226-1234. 

41. Nicholson AG, Gonzalez D, Shah P, Pynegar MJ, Deshmukh M, et al. (2010) 
Refining the diagnosis and EGFR status of non-small cell lung carcinoma in 
biopsy and cytologic material, using a panel of mucin staining, TTF- 1 , 
eytokeratin 5/6, and P63, and EGFR mutation analysis. J Thorae Oncol 5:436- 
441. 

42. Hamamoto J, Soejima K, Yoda S, Naoki K, Nakayama S, et al. (2013) 
Identification of microRNAs differentially expressed between lung squamous cell 
carcinoma and lung adenocarcinoma. Mol Med Rep 8:456-462. 

43. Sakai Y, Nakai T, Ohbayashi C, Imagawa N, Yanagita E, ct al. (2013) 
Immunohistochemical profiling of ALK fusion gene-positive adenocarcinomas 
of the lung. Int J Surg Pathol 21:476-482. 



PLOS ONE | www.plosone.org 



7 



July 2014 | Volume 9 | Issue 7 | e100837 



